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Abstract. We present a definition of the class NP in combinato- 
rial context as the set of languages of structures defined by finitely 
many forbidden lifted substructures. We apply this to special syn- 
tactically defined subclasses and show how they correspond to nat- 
urally defined (and intensively studied) combinatorial problems. 
We show that some types of combinatorial problems like edge col- 
orings and graph decompositions express the full computational 
power of the class NP. We then characterize Constraint Satisfac- 
tion Problems (i.e. ii-coloring problems) which are expressible by 
finitely many forbidden lifted substructures. This greatly simplifies 
and generalizes the earlier attempts to characterize this problem. 
As a corollary of this approach we perhaps find a proper setting of 
Feder and Vardi analysis of CSP languages within the class MM- 
SNP. 

1. Introduction 

Think of a 3-colorability of a graph G = (V,E). This is a well 
known hard problem and there is a multiple evidence for this: concrete 
instances of the problem are difficult to solve (if you want a non-trivial 
example consider Kneser graphs; [21]), there is an abundance of mini- 
mal graphs which are not 3-colorable (these are called 4-critical graphs, 
see e.g. [H]) and in the full generality (and even for important "small" 
subclasses such as 4- regular graphs or planar graphs) the problem is a 
canonical NP-complete problem. 

Yet the problem has an easy formulation. A 3-coloring is simple to 
formulate even at the kindergarten level. This is in a sharp contrast 
with the usual definition of the class NP by means of polynomially 
bounded non-deterministic computations. Fagin [B] gave a concise de- 
scription of the class NP by means of logic: NP languages are just 
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languages accepted by an Existential Second Order (ESO) formula of 
the form 

3P*(S,P), 

where S is the set of input relations, P is a set of existential relations, 
the proof for the membership in the class, and ^ is a first-order for- 
mula without existential quantifiers. This definition of NP inspired a 
sequence of related investigations (see e.g. [5], [T31 ED] and these descrip- 
tive complexity results established that most major complexity classes 
can be characterized in terms of logical definability of finite structures. 
Particularly this led Feder and Vardi j7] to their seminal reduction of 
Constraint Satisfaction Problems (shortly CSP) to the so called MM- 
SNP (Monotone Monadic Strict Nondeterministic Polynomial) prob- 
lems which also nicely link MMSNP to the class NP in computational 
sense. This will be explained in some detail in Section 3 which presents 
one of the main motivations of this paper. Inspired by these results we 
would like to ask an even simpler question: 

Can one express the computational power of the class NP by combi- 
natorial means? 

From the combinatorial point of view there is a standard way how 
to approach (and sometimes to solve) a monotone property P: one 
investigates those structures without the property P which are critical, 
(or minimal) without P. One proceeds as follows: denote by T the class 
of all critical structures and define the class Forb(JF) of all structures 
which do not "contain" any F e T . The class Forb(JF) is the class of 
all structures not containing any of the critical substructures and thus 
it is easy to see that Forb(jF) coincides with the class of structures 
with the property P. Of course in most cases the class T is infinite 
yet a structural result about it may shed some light on property P. 
For example this is the case with 3-colorability of graphs where 4- 
critical graphs were (and are) studied thoroughly (historically mostly 
in relationship to the Four Color Conjecture). 

Of particular interest (and as the extremal case in our setting) are 
those monotone properties P of structures which can be described by 
finitely many forbidden substructures. It has been proved in a se- 
quence of papers [TJ [29] that a homomorphism monotone problem is 
First Order (shortly FO) definable if and only if it is positively FO 
definable (shortly FO+ definable), i.e. the formula does not contain 
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any negations (and so implications and inequalities), and thus alter- 
natively defined as Forb(JF) for a finite set T of structures. Although 
FO-definability is not a rare fact (and extremely useful in database the- 
ory), still FO-definability cannot express most combinatorial problems 
(compare [27], [1] which characterize all CSP which are FO-definable; 
see also Theorem [1]). Thus it may seem to be surprising that the classes 
of relational structures defined by ESO formulas (i.e. the whole class 
NP) corresponds exactly to those canonical lifts of structures which 
are defined by a finite set of forbidden substructures. Shortly, finitely 
many forbidden lifts determine any language in NP. This is being made 
precise in Section 3. Here, let us just briefly illustrate this by our ex- 
ample of 3-colorability. Instead of a graph G = (V, E) we consider the 
graph G together with three unary relations C\, C2, C3 which cover the 
vertex set V; this structure will be denoted by G' and called a lift of G 
(G' has one binary and three unary relations). There are 3 forbidden 
substructures or patterns: For each i = 1,2,3 the graph K 2 together 
with cover C$ = {1,2} and Cj = for j 7^ i form pattern F- (where 
the signature of F- contains one binary and three unary relations). The 
language of all 3-colorable graphs then corresponds just to the language 
$(Forb(F' 1 , F' 2 , F 3 )) where $ is the forgetful functor which transforms 
G' to G and the language of 3-colorable graphs is just the language of 
the class satisfying formula 3G'(G' e Forb(F' 1 , F' 2 , F 3 )). This extended 
language (of structures G') of course just expresses the membership of 
3-colorability to the class NP. There is more than this that meets the 
eye. This scheme fits nicely into the mainstream combinatorial and 
combinatorial complexity research. Building upon Feder-Vardi classifi- 
cation of MMSNP we isolate (in Theorems [5J [7J M) three computation- 
ally equivalent formulations of NP class: 

(1) By means of shadows of forbidden homomorphisms of relational 
lifts (the corresponding category is denoted by Rel cov (A, A')), 

(2) By means of shadows of forbidden injections (monomorphisms) 
of monadic lifts (the corresponding category will be denoted by 

(3) By means of shadows of forbidden full homomorphisms of monadic 
lifts (the corresponding category will be denoted by RelJ^ u (A, A')). 

Our results imply that each of these approaches includes the whole 
class NP. It is interesting to note how nicely these categories fit to 
the combinatorial common sense about the difficulty of problems: On 
the one side the problems in CSP correspond and generalize ordinary 
(vertex) coloring problems. One expects a dichotomy here: every CSP 
problem should be either polynomial or NP-complete (as conjectured 
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in [7J and probabilistically verified in [IB])- On the other side the above 
formulations (1), (2), (3) model the whole class NP and thus we can- 
not expect dichotomy there (by a celebrated result of Ladner [I?]). But 
this is in accordance with the combinatorial meaning of these classes: 
the formulation (1) expresses coloring of edges, triples etc. and thus it 
involves problems in Ramsey theory [9j [23] . The formulation (2) may 
express vertex coloring of classes with restricted degrees of vertices 
[T51 [TT] . The formulation (3) relates to vertex colorings with a given 
pattern among classes which appears in many graph decomposition 
techniques (for example in the solution of the Perfect Graph Conjec- 
ture [3]). The point of view of forbidden partitions (in the language of 
graphs and matrices) is taken for example in [10] . This clear difference 
between combinatorial interpretations of syntactic restrictions on for- 
mulas expressing the computational power of NP is one of the pleasant 
consequences of our approach. 

At this point we should add one more remark. We of course do not 
only claim that every problem in NP can be polynomially reduced to 
a problem in one of these classes. This would only mean that each 
of these classes contains an NP-complete problem. What we claim is 
that these classes have the computational power of the whole of NP, 
i.e. these classes are computationally equivalent to all problems in NP. 
More precisely, to each language L in NP there exists a language M in 
any of these three classes such that M is polynomially equivalent to L, 
i.e. there exist polynomial reductions of L to M and M to L. 

Having finitely many forbidden patterns (i.e. forbidden substruc- 
tures) for a class of structures K we are naturally led to the question 
whether /C is the class determined by a finite set of templates, or in 
other words by the existence of homomorphisms to particular struc- 
tures. In technical terms (see e.g. [HIE]) this amounts to the question 
whether /C is an instance of a Constraint Satisfaction Problem (shortly 
CSP). On the other hand finitely many forbidden patterns lead to the 
question whether the class /C is not defined by a finite duality. This 
scheme for combinatorial problems goes back to [23], see e.g. [11] and it 
was studied in situations as diverse as bounded tree width dualities |12j . 
duality of linear programming [TT] and classes with bounded expansion 
|25j . Here we completely characterize (using results of [27]) shadows 
of finitary dualities in the case where the extension of the language 
is monadic, i.e. it consists of unary relations (as is the above case of 
3-coloring), see Theorem [131 These general results can be used in the 
investigation of the class MMSNP (to be defined in Section 3). Feder 
and Vardi introduced this class as a fragment of SNP in [7J. They 
proved that the class MMSNP is randomly polynomially equivalent to 



FORBIDDEN LIFTS (NP AND CSP FOR COMBINATORISTS) 5 

the class of finite union of CSP languages. This was later derandom- 
ized by the first author proving that the classes MMSNP and CSP are 
computationally equivalent [16]. We will examine these classes from 
the viewpoint of descriptive complexity theory: Any finite union of 
CSP languages belongs to MMSNP. But the converse does not hold. 
Consider for example the language of triangle free graphs: this is an 
MMSNP language which is not a finite union of CSP languages. Made- 
laine and Stewart introduced the class of Forbidden Pattern Problems 
(FP) as an equivalent combinatorial version of MMSNP [2Uj, [IS]- They 
gave an effective, yet lengthy process to decide whether an MMSNP 
language is a CSP language. We give a short and easy procedure to de- 
cide whether an MMSNP language is a finite union of CSP languages, 
and we show that these are exactly those languages defined by forbid- 
den patterns not containing any cycle. This simplicity is possible by 
translation and generalization of the Feder-Vardi proof of the computa- 
tional equivalence of finite union of CSP's and MMSNP in the context 
of category theoretical language of duality. 

The paper is organized as follows: In Section 2 we review the basic 
notions and previous work related to finite structures. Particularly we 
state two our basic tools: the characterization of finite dualities [23 E] 
and a combinatorial classique, the sparse incomparability lemma. It is 
here where we introduce two our basic notions of lifts and shadows. 
The interplay of corresponding classes (categories) is a central theme 
of this paper. In Section 3 we introduce the relevant notions of descrip- 
tive complexity (mostly taken from [7]) and relate it to our approach. 
We prove that the class NP is polynomially equivalent with classes of 
structures characterized by finitely many forbidden lifts (this is proved 
in three different categories, see Theorems 0, El and [S]) • In Section 4 we 
study the relationship of lifts and shadows abstractly from the point of 
view of dualities. Theorem [13] enables us to prove the characterization 
of shadows of finite dualities (called lifted dualities) in lifts and shad- 
ows. This, as a corollary, proves the main result of [20J. In Section 5 we 
return to Feder-Vardi setting and indicate how the polynomial equiv- 
alence of classes MMSNP and finite unions of CSP problems emerges 
naturally in our combinatorial-categorical context. 

For more complicated (i.e. nonmonadic) lifts we (of course) have 
partial results only. Perhaps the next case is that of covering equiv- 
alences. This we are still able to handle with our methods and we 
characterize all CSP languages in this class. But we postpone this to 
another occasion. 
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2. Categories of Finite Structures 

For a relational symbol R and relational structure A let A = X(A) 
denote the universe of A and let R(A) denote the relation set of tu- 
ples of A which belong to R. Let A denote the signature (type) of 
relational symbols, and let Rel(A) denote the class of all relational 
structures with signature A. We will often work with two (fixed) sig- 
natures, A and A U A' (the signatures A and A' are always supposed 
to be disjoint). For convenience we denote structures in Rel(A) by 
A,B etc. and structures in Rel(A U A') by A', B' etc. We shall de- 
note Rel(A U A') by Rel(A, A'). The classes Rel(A) and Rel(A, A') 
will be considered as categories endowed with all homomorphisms. Re- 
call, that a homomorphism is a mapping which preserves all relations. 
Just to be explicit, for relational structures A, B G Rel(A) a mapping 
/ : X(A) — > X(B) is a homomorphism A — > B if for every rela- 
tional symbol R G A and for every tuple (x\, . . . , xt) G R(A) we have 
(f(xi),...,f(x t )) G R(B). More generally we will use the notation 
A — > B for morphisms when working in other categories. (These 
will be categories of relational structures, where the morphisms will 
be either the injective or the full homomorphism, respectively.) Simi- 
larly we define homomorphisms for the class Rel(A, A'). The interplay 
of categories i?e/(A,A') and Rel(A) is one of the central themes of 
this paper. Towards this end we define the following notions: Let 
<3> : Rel(A, A') — > Rel(A) denote the natural forgetful functor that 
"forgets" relations in A'. Explicitly, for a structure A' G Rel(A,A') 
we denote by $(A') the corresponding structure A G Rel(A) defined 
by X(A') = X(A), R(A') = R(A) for every R G A. For a homomor- 
phism / : A' — > B' we put $(/) = /. The mapping / is of course also 
a homomorphism $(A') — ► $(B'). This is expressed by the following 
diagram. 




FORBIDDEN LIFTS (NP AND CSP FOR COMBINATORISTS) 7 

These object-transformations call for a special terminology: For 
A' G Rel(A,A') we call $(A') = A the shadow of A'. Any A' with 
$(A') = A is called a lift of A. The analogous terminology is used 
for subclasses C of Rel(A,A') and Rel(A). (Thus, for example, for a 
subclass C C Rel(A, A'), $(C) is the class of all shadows of all struc- 
tures in the class C.) The following special subclass of Rel(A, A') will 
be important: denote by Rel cov (A, A') the class of all structures in 
Rel(A,A') where we assume that all relations in A' have the same 
arity, say r, and that all the r-tuples of an object are contained by 
some relation in A'. The category Rel cov (A, A') is briefly called cover- 
ing or r-covering category. Note that the class Rel cov (A, A') is closed 
under surjective homomorphisms. We will work with two other similar 
pairs of categories. We denote by Reli n j(A) and Relf u u{A) the cate- 
gories where the objects are again the relational structures of type A, 
but the morphisms are the injective and full homomorphisms, respec- 
tively. We call a mapping a full homomorphism if it is relation and 
non-relation preserving, too. Such mappings have very easy structure, 
as every full homomorphism which is onto is a retraction. We denote 
by Rel^(A, A') and RelJ^ u {A, A') the subclasses containing the same 
class of objects as Rel cov (A, A'). We only will use these notions in the 
case when A' contains monadic relations. 

Let T' be a finite set of structures in the category C (one of the above 
categories). By Forb(JF') we denote the class of all structures A' G C 
satisfying F' -f—^ A' for every F' G J 7 '. (This class is sometimes and 
perhaps more efficiently denoted by T' /».) Similarly (well, dually), 
for the finite set of structures V in C we denote by CSP(T>') the 
class of all structures A' G C satisfying A' — > D' for some D' G T>' . 
(This is sometimes denoted by — ► T>.) Now suppose that the classes 
Forb(jF') and CSP(V') are equal. Then we say that the pair (J 7 ', V) is 
a finite duality in C. Explicitly, a finite duality means that the following 
equivalence holds for every structure A' G C: 
F' -/-^ A' for every F' G T' iff A' — ► D' for some D' G V. 

One more definition is needed. In dualities (as well as in most of 
this paper) we are interested in the existence of a homomorphism (ev- 
ery CSP can be expressed by the existence of a homomorphism to a 
template; see [7],[TT]) . Consequently we can also use the language of 
partially ordered sets and consider the homomorphism order defined 
on the class of all structures with signature A: we define the order < 
by putting A < B iff there is a homomorphism A — > B. The ordering 
< is clearly a quasiorder but this becomes a partial order if we either 
factorize Ca by the homomorphism equivalence or, perhaps preferably, 
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if we restrict Ca to non-isomorphic core structures. We say that A 
is core if every homomorphism A — > A is an automorphism. Every 
finite structure A contains (up to an isomorphism) a uniquely deter- 
mined core substructure homomorphically equivalent to A, see [271 [TT] . 
The following result was recently proved in [8] as a generalization of 
[27] . It characterizes finite dualities of finite structures., i.e. in the 
category Rel(A). 

Theorem 1. For every signature A and for every finite set T of (re- 
lational) forests there exists (up to a homomorphism equivalence) a 
uniquely determined set T> of structures such that (JF, T>) forms a finite 
duality. Up to a homomorphism equivalence there are no other finite 
dualities. 

We did not define what is a forest (see [271 IB])- For the sake of 
completeness let us say that a forest is a structure not containing any 
cycle. And a cycle in a structure A is either a sequence of distinct 
points and distinct tuples x ,ri,xi, . . . ,r t ,x t = x where each tuple 
belongs to one of the relations -R(A) and each coordinate of r,- 

and r i+1 , or, in the degenerated case t = 1 a relational tuple with at 
least one multiple coordinate. The length of the cycle is the integer t in 
the first case and 1 in the second case. Finally the girth of a structure 
A is the shortest length of a cycle in A (if it exists; otherwise it is a 
forest). 

The study of homomorphism properties of structures not containing 
short cycles (i.e. with large girth) is a combinatorial problem studied 
intensively. The following result has proved particularly useful in vari- 
ous applications. It is often called the Sparse Incomparability Lemma: 

Lemma 2. Let k,£ be positive integers and let A be a structure. Then 
there exists a structure B with the following properties: 

(1) There exists a homomorphism f : B — > A; 

(2) For every structure C with at most k points the following holds: 
there exists a homomorphism A — > C if and only if there exists 
a homomorphism B — > C; 

(3) B has girth > I. 

B 
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This result was proved in [26j EE] (see also [TT]) by probabilistic 
methods. In fact in [2BII2B] it was proved for graphs only but the proof 
is the same for finite relational structures. Of particular interest in this 
context is the question whether there exists an explicit construction of 
the structure B. This is indeed possible: in the case of binary relations 
(digraphs) this was done in |22j and for general relational structures in 

m 

3. NP BY MEANS OF FINITELY MANY FORBIDDEN LIFTS 

There is a standard connection between formulae and existence of 
homomorphisms. This goes back to [2] and it can be formulated as 
follows: 

To every structure A in Rel(A) we associate the canonical conjunc- 
tive existential formula as the conjunction of the atoms Ra(x), 
where R G A preceded by existential quantification of all elements of 
A. Clearly this process may be reversed and thus there is a one-to-one 
correspondence between canonical conjunctive existential formulae and 
structures. It is then obvious that the following holds: 

There is a homomorphism A — > B if and only if B |= <^a- 
Following Fagin [6] , the class SNP consists of all problems expressible 
by an existential second-order formula with a universal first-order part. 
The class of problems expressible by an existential second-order formula 
is exactly the class NP when restricted to languages of finite structures. 
So the class SNP is computationally equivalent to NP. The input of any 
problem in SNP is a relational structure A of signature A with base 
set A = X(A) and II is a set of relations on the same base set A. In 
this situation it is customary to call the second order relations II proof. 
Let us be more specific (see [7]). Every language (problem) L in SNP 
may be equivalently described by a formula of the form 

3nVx/\-.(oi A Pi A Si), 

i 

where 

(1) a, is a conjunction of atoms or negated atoms involving vari- 
ables and input relations (i.e. of the form R(x) and ->R(x) for 
a relational symbol R and x a tuple of elements of X), 

(2) Pi is a conjunction of atoms and negated atoms involving vari- 
ables and existential (proof) relations (i.e. of the form P(x) 
and ->P(x) for P G II and x a tuple of elements of X) and 

(3) £j is the conjunction of atoms involving variables and inequali- 
ties (i.e. of form x ^ y). 
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A formula of this type is called a canonical formula of the language 
L in SNP. It will be denoted by tpi. 

Example: Consider the following language of digraphs (i.e. rela- 
tional structures, where the signature contains one single binary rela- 
tion E) defined by the following ESO formula: 

3Pi3P 2 Vxi, x 2 , x 3 , y f\ k ^ [{P k {x 1 )AP k (x 2 )AP k {x 3 ))A{E{x 1 , x 2 )AE(x 1 ,x 3 )A 
E(x 2 ,x 3 )) A (xi ^x 2 Ax 1 j£ x 3 A x 2 ^ x 3 )] A [~ | (~ | -Pi(y) A ->P 2 (y))\. 

This formula corresponds to the language of all binary relations 
whose base set can be covered by two sets in such a way that none 
of these sets contains linearly ordered set with 3 elements. If we in 
addition postulate that the relation E is symmetric then these are 
just graphs which can be vertex partitioned into two triangle free 
graphs. Following [7] one can also define three important syntactically 
restricted subclasses of SNP. 

We say that a canonical formula is monotone if there are no negations 
in the a^s. This implies that more relations lead to less satisfiable 
formulae. The canonical formula is monadic if the relations in IT are 
all monadic (which means that all proof relations are unary). The 
canonical formula is said to be without inequality if it can described by 
a canonical formula which does not contain £j. 

Feder and Vardi [7J have proved that the three subclasses of SNP 
defined by formulae with any two of these syntactical restrictions still 
have the full computational power of the class NP. 

Theorem 3. [7J 

(1) Every problem in NP has a polynomially equivalent problem in 
monotone SNP without inequality. Moreover, we may assume 
that the existential relations are at most binary. 

(2) Every problem in NP has a polynomially equivalent problem in 
monotone, monadic SNP. 

(3) Every problem in NP has a polynomially equivalent problem in 
monadic SNP without inequality. 

(The claim that we may restrict to binary relations in (1) is not 
stated in [7j but it is clear from the proof.) The class with all the 
three restrictions is denoted by MMSNP (Monotone Monadic Strict 
Nondeterministic Polynomial). We deal with this class in Section 6. 

In this paper we will reformulate Theorem [3] in our combinatorial 
category lift / shadow setting. This will be done in Theorem [5] for item 
(1), in Theorem [7J for item (2) and in Theorem [9] for item (3). First, we 
introduce the following: we say that the formula is primitive if for every 
clause (a>i A Pi A e^j , every variables Xi, . . . , x r occurring in it and every 
existential relation P E II of arity r either the atom P{x\, . . . ,x r ) or 
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its negation is an atom of the clause. We need the following technical 
lemma. 

Lemma 4. Every language in SNP can be described by a primitive 
formula. Moreover, if the original formula satisfies some of the restric- 
tions (i.e. if it is either monotone or monadic or without inequality) 
then so does the primitive formula. 

Proof. Consider the language L and the canonical formula defining L: 
3PVx f\ { -i {on A Pi A eA . We modify the formula so that for every proof 
relation R of arity r and variables Xi, . . . ,x r G S appearing in ai or $ 
either R(x\, . . . , x r ) or -iR(x\, . . . , x r ) is in the appropriate conjunct. 

In order to have such a formula we can replace -i (^cti A fliAeA by -i {on A 

PiAeiAR(xi, . . . , x r )) A-i(aiAj3iAeiA-iR{xi, . . . , x r )) , this is equivalent 
to the original formula. The repetition of this process will terminate 
in finitely many steps, and it gives an appropriate formula. □ 

Theorem 5. For every language L G NP there exist relational types 
A, A' and a finite set T 1 of structures in Rel cov (A, A') such that L is 
computationally equivalent to <§>{Forb(J-')). Moreover, we may assume 
that the relations in A' are at most binary. 

This theorem presents an equivalent form of item (1) of Theorem |3] 
by means of homomorphisms and classes Forfe(jF'). It is interesting 
to express other conditions (2), (3) of Theorem [3] by means of homo- 
morphisms and classes Forfo(jF'). These two other versions are stated 
below as Theorems [7] and 

Proof. Consider a language L and the canonical formula (fL (showing 
that it is monotone SNP without inequality). The construction of T' 
consists of two steps. In the first step we enforce technical conditions 
on the formula. 
Step 1. 

We need the technical assumption that all proof relations in II have 
the same (at most binary) arity and the formula is primitive. The first 
condition can be achieved by exchanging relational symbols not of max- 
imal arity by new relational symbols of maximal arity (binary would 
suffice). We can proceed as follows. In every clause of the formula we 
put new (free) different variables into the new entries in /3j, and we 
increase the number of variables in x, too. This new formula is equiv- 
alent to the original one. An evaluation satisfies to the new formula 
exactly iff its restriction to the original variables satisfies the original 
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formula. By Lemma H] we may also assume that the new formula is 
primitive. In the following we denote this formula by (p^. 

In the second step we define lifts. 
Step 2. 

The type A' will contain 2' n ' relational symbols corresponding to the 
2' n l possibilities for a subset of proof relations indicating possibilities 
in which a tuple can be. The pattern F- will correspond to the clause 
ai A Pi. The base set of each structure F[ is the set of variables in the 
clause «j A A tuple t of variables is in a relation R (of type A) if 
the atom R(i) appears in c^. Every tuple t in F- (of appropriate arity) 
is in exactly one relation from A', this is the relation corresponding 
to the subset of all existential relations P £ II such that the atom 
P(t) appears in Let T' be the set of all lifts F^. These may be 
disconnected, although we may work with their connected components, 
see Remark [6j 

We prove that for a structure A £ Rel(A) the formula ipi, is satisfi- 
able iff there is a lifted structure A' £ Rel(A, A') such that no F- £ T 1 
maps to A'. 

Suppose that A £ <&{Forb{T')), i.e. there is a lift A' £ Forb(P). 
We may suppose that every tuple of A' is in exactly one A' relation. 
This single A' relation corresponds to a subset of relational symbols 
in II. For every relational symbol P £ II define -P(A) to be the set 
of tuples t such that the relation in A' containing t corresponds to a 
subset containing P. Denote this structure of type (A, II) by A". We 
show that these relations prove A [= (p^. We have to prove that A" |= 
-i(a!j A Pi) holds for every clause. Consider the corresponding forbidden 
lift FV We know that F'* -> A', which yields A" |= -.(a* A 

Secondly suppose that A \= ip L . We can correspond to the proof 
relations on A a A' covering of the (binary) tuples in A, where every 
tuple is covered exactly once. This lift A' shows that A £ Q(Forb(T')). 
Consider a forbidden lift F', £ T' . We know that the proof relations 
of type II satisfy the formula -i(aj A Pi), hence F'j A'. □ 
Remark 6. Consider the languages Q(Forb{T')) and <&(Forb(G')). 
Their union is exactly the language <$>(Forb(H')), where TC' = {F'UG' : 
F' £ ForblJ 7 '), G' £ Forb(Q')}. Hence the languages of the form 
$(Forfe(jF')) are closed under union. In the proof of Theorem Owe may 
restrict ourselves to connected lifts when proving that the constructed 
$(For6(jF')) is the desired language. 

Let us now formulate and prove the two analogous theorems for the 
class monotone, monadic SNP and for the class monadic SNP without 
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inequality (which correspond to (2) and (3) of Theorem [3]) . Here we 
use the categories Rel%%(A, A') and Rel c f ™ u (A, A'). 
Theorem 7. For every language L e NP there exist relational types 
A and A', where A' contains only unary relational symbols and a finite 
set T 1 C Rel^(A, A') such that L is computationally equivalent to the 
class $(Forb i „ i (J' / )). 

Proof. We proceed analogously as in the proof of Theorem [5] for formu- 
las which are monotone monadic SNP. We stress the differences only. 
First, using Lemma H] again, we may suppose that L is defined by a 
canonical primitive formula. This constitutes the first step as now we 
do not have problem with the arity of the proof relations since these 
are all monadic. 
Step 2. 

We want to enforce for (aiAfa Ae») and distinct variables x, y appearing 
in it that x ^ y is an atom of £j. If this atom is not in fa then we 
exchange — i(ck» A fa A £j) by the following conjunction: ^(ct^ A fa x A 
£ h) A _, ( a «2 A fa 2 A£j 2 ), where -'(a!j 1 A A e i:l ) is -i(q;, A fa A £j) except 
that we replace all occurence of y by a; in it, a i2 = a^fa 2 = Pi and 
Ei 2 = Ei A ix ^ y). This new formula is equivalent to the original one. 
In finitely many steps we manage to enforce that all the required atoms 
of the form x ^ y are there in the appropriate £j. 

We now define A' in the same way as in Theorem [5] (thus A' is a 
monadic type). The set of forbidden lifts T' is also defined analogously 
as in Theorem [5] with the only one difference which relates to the con- 
struction of formula (p-p> which will have now more clauses: the formula 
ip F i will have all the atom clauses as in Theorem[S](i.e. v?p'(xi, . . . , £|f'|) 
will contain as atoms all those tuples which express the fact that a tu- 
ple a is in the homomorphic image of F') and additionally we will have 
atoms x y for every pair of different variables. After this change we 
see easily that the rest of the proof does not depend on which category 
we work. 

□ 

Remark 8. If we do not enforce the condition that the atom x ^ y 
appears in every clause containing the variables x and y (Step 1. of the 
proof) before constructing T' then we get some weaker characteriza- 
tion. Namely, the language L will be similar to the form of Theorem [7| 
but we allow partially injective mappings not only injective ones. For 
every F' G T' and pair x, y 6 F' we may have the plus condition that 
they can not collapse by a homomorphism. The class defined by such 
partially injective forbidden lifts still equals to the class of languages 
of the form $(Forbj n j(jF')): we can do Step 2. in this combinatorial 
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setting, too. Here the transformation means that for any F' £ T' and 
pair x, y £ F' which may collapse, we exchange F' by two new forbid- 
den structures. One of the structures is F' with conditions on the same 
pairs plus we require that x and y may not collapse. The other is a 
factor of F' where we identify x and y, and we have the condition on 
a pair of elements of new structures not to collapse iff we have it on 
a pair in their preimages in F'. The iteration of this transformation 
expresses a language defined in partially injective setting in the fully 
injective terminology of Theorem [7] (with the same A and A'). 

Let us now transform the third syntactic class of Theorem [3j 
Theorem 9. For every language L £ NP there exist relational types 
A and A' , where A' contains only unary relational symbols and a finite 
set T' C Relf^ u (A, A') such that L is computationally equivalent to the 
class $(Forb /u «(.T))- 

Proof. The proof of Theorem M is again a modification of the above 
proof of Theorem [5] (and of Theorem [7J for formulas in monadic SNP 
without inequality. The construction of T' is even easier: Again, in 
Step 1., it suffices to assume that tpi is canonical primitive. We only 
need to be careful with construction of the formula <pf'(xi, • • • , £|F'|) ex- 
pressing the fact that the set {xi, . . . ,x\f'\} is the homomorphic image 
of F' (recall that all homomorphism are now considered in Relf u u(A, A')) 
The formula will contain again more atoms. For every tuple a in the 
input relation R we will have an atom expressing that the image of the 
tuple is in relation R like in the proof of Theorem [51 But additionally 
we will have the negation of such an atom for every tuple not contained 
by an input relation. The rest of the proof is again the same. □ 

Similarly as above (Remark [8] to Theorem [7J we have the possibility 
to state a weaker theorem in the notion of partially full mappings. 
Consider a relational symbol R of arity q. We may have two conditions 
on an g-tuple in a structure A, either it is in R or not. In the category 
Rdfuii this gives some restrictions on the homomorphisms of A in 
both cases. We may generalize the class of objects such that for every 
relation R and g-tuple we have three possibilities (from the viewpoint of 
mappings to a structure): either the tuple should be mapped to a tuple 
in R, or to a tuple not in R or we have no restriction. We may define a 
class of languages in Rel(A) using this enlarged set of forbidden lifts. 
However this new class of languages is still equal to those of the form 
$(Forb/ u //(jF')). This may be seen as follows: a forbidden lift in this 
new setting may be replaced by a set of forbidden lift in RelJ^ u (A, A') 
as for the set of tuple-relation pairs without any condition we take all 
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possibilities of relation and non-relation conditions. This new set of 
forbidden lifts defines the same language. 

4. Lifts and Shadows of Dualities 

Some of the transformations presented in Section 4 lead to deeper 
results - the lifts and shadows give rise to a life on their own. We prove 
here two results which will prove to be useful in the next section. 

It follows from the Section 3 that shadows of classes Forb(jF') (in 
three categories Rel cov (A, A'), iteZg$(A,A') and Relf" u (A, A')) in- 
clude all NP-complete languages. What about finite dualities? A deli- 
cate interplay of lifting and shadows for dualities is expressed by the fol- 
lowing two statements which deal (for brevity) with classes Rel cov (A, A') 
only. Despite its formal complexity Theorem [TD] is an easy statement. 

Theorem 10. Let T' be a finite set of structures in Rel cov (A, A'). 
Suppose that there exist a finite set of structures V such that (J 7 ', V) 
is a finite duality in Rel cov (A, A'). Then the following sets coincide: the 
shadow $(Forb(.T)) = {$( A ') : A ' e Forb(.T)} and CSP{§{V')) = 
Vd'gC CSP($(D')). Explicitly: for every A G Rel(A) there exists 
A' G Rel(A, A'), $(A') = A and F' -f-^ A' for every F' G T' iff 
A — ► $(D') for some D' G V . 

Note that (in the statement of Theorem [TU] we do not claim that 
the pair ($(JF'), $(£>')) is a duality in the class Rel(A). This of course 
does not hold (as shown by our example of 3-colorability in the intro- 
duction). But the images of all structures defined by all obstacles of 
CSP(V) are forming all obstacles of CSP($(W)). We call this shadow 
duality. 

It is important that Theorem [TOl may be sometimes reversed: shadow 
dualities may be sometimes "lifted". This is non-trivial and in fact 
Theorem [11] may be seen as the core of this paper. 
Theorem 11. Let T' be a finite set of structures in Rel cov (A, A'), con- 
sider Forb(J zr/ ) and suppose that $(Forb(J zr ')) = CSP{V) (in Rel(A)) 
for a finite set T> of objects of Rel(A). (In other words let the pair 
[T ] ,V) form a shadow duality.) Assume also that CSP(V) ^ Rel(A) 
and that A' contains only unary relations. Then there exists a finite 
set V in Rel cov (A, A') such that Forb(.F) = CSP(V). 

Before proving Theorems [TD] and [TT] we formulate first a lemma which 
we shall use repeatedly: 

Lemma 12. (lifting) Let A, B G Rel(A), homomorphism f : A — > 
B and $(B') = B be given. Then there exists A' G Rel cov {A, A'), 
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$(A') = A such that the mapping f is a homomorphism A' — > B' in 
Rel cov (A, A'). 



A' 



f 



T 

A 



B 



Proof. Assume that A, B G Rel(A), $(B') = B and / : A — > B are 
as in the statement. For each R G A' put R{A!) = f~ 1 (R(B')). It is 
easy to see that A' G Rel cov (A, A') □ 

Proof, (of Theorem Mj Suppose that A G CSP(<S>{V')), say A G 
CSP($(D')). Now for a homomorphism / : A — ► $(D') there is 
at least one lift A' of A such that the mapping / is a homomor- 
phism A' — > D' (here we use Lifting Lemma [T2l) . By the duality 
(J 7 ', V) (in Rel cov (A, A')) F' A' for any F' G X>' and thus in turn 

A G $(Forb(.F))- 

Conversely, let us assume that A' G Forb(JF') satisfies $(A') = A. 
But then A' G CSP(V) and thus by the functoral property of $ we 
have A = $(A') G CSP($(V')). 

□ 

Proo/. (of Theorem HI]) Assume $(Forb(^')) = CSP(V). Our goal is 
to find X)' such that Forb(JF') = CSP(T>'). This will follow as a (non- 
trivial) combination of Theorem [1] and Lemma [21 By Theorem [1] we 
know that if T' is a set of (relational) forests then the set T' has a dual 
set X?' (in the class Rel cov (A, A')). So assume to the contrary that one 
of the structures, say Fq, fails to be a forest (i.e. we assume that one of 
the components of Fg has a cycle). We proceed by a refined induction 
(which will allow us to use more properties of F ). Let us introduce 
carefully the setting of the induction. 

We assume shadow duality $(Forb(J^')) = CSP(V). Let V be fixed 
throughout the proof. Clearly many sets T' will do the job and we 
select the set T' such that T' consists of cores of all homomorphic 
images (explicitly: we close T' on homomorphic images and then take 
the set of cores of all these structures). Among all such sets T' we 
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take a set of minimal cardinality. It will be again denoted by J-'. We 
proceed by induction on the size \T'\ of J 7 '. 

The set Forb(jF') is clearly determined by the minimal elements of T' 
(minimal in the homomorphism order). Thus let us assume that one of 
these minimal elements, say F , is not a forest. By the minimality of T' 
we see that we have a proper inclusion $(Forb(jF'\ {Fq})) D CSP{V). 
Thus there exists a structure S in the difference. But this in turn means 
that there has to be a lift S' of S such that F — ► S' and S -/-> D for 
every D G T>. In fact not only that: as Fq is a core, as Forb(jF') is 
homomorphism closed and as T' has minimal size we conclude that 
there exist S and S' such that any homomorphism F' — > S' is a 
monomorphism (i.e. one-to-one, otherwise we could replace F by a 
set of all its homomorphic images - Fq would not be needed). 

Now we apply (the second non-trivial ingredient) Lemma [2] to the 
structure S and an £ > \X(F' )\: we find a structure So with the 
following properties: So — > S, So — ► D if and only if S — ► D for 
every Del 1 and So contains no cycles of length < i. It follows that 
S ^ CSP(V). Next we apply Lemma [T21 to obtain a structure S with 
S — > S'. Now we use that all relations in A' are unary and we see that 
S does not contain cycles of length < I. Now for any F' G J 7 ', F' 7^ F 
we have F' S as S — > S' and F' -/» S'. As the only homomorphism 
F — > S' is a monomorphism the only (hypothetical) homomorphism 
F — > S' is also monomorphism. But this is a contradiction as F' 
contains a cycle while S' has no cycles of length < £. This completes 
the proof. □ 



5. MMSNP AND FORBIDDEN PATTERNS 

Madelaine [19] introduced the class FP. Every language of the class 
FP is defined by forbidden patterns which are defined as follows. Con- 
sider the finite relational type A, the finite set T and the set of pairs 
(Fi, <fx), . . . , (F n , (p n ), where each Fj G Rel(A) and ipi : Fj — > T is a 
mapping (i = l,...,n). The language L belongs to the class FP if 
there are patterns (F 1; (fx), . . . , (F n , (p n ) such that L is the class of all 
structures A G Rel(A) for which there exists a mapping (p : A — > T 
such that for all i — 1, . . . , n no homomorphism a : Fj —>■ A satisfies 
ipoa 7^ (fi. Formally: L = {A G Rel(A) : 3ip : A — >• T such that Vi, a : 
Fj — » A homomorphism ip o a 7^ ifii}. 

This is a special case of our approach and the class FP may be 
equivalently defined as follows (using lifts and shadows): we say that 
the set L C Rel(A) is an FP-language if there exist a finite type A' of 
monadic (unary) relational symbols and a language V G Rel cov (A, A') 



18 



GABOR KUN AND JAROSLAV NESETRIL 



such that L = $(Forb(F')) for a finite set F Q Rel cov (A, A'). (Thus 
A' is a partition on every F' G J 7 '.) The equivalence is clear: we con- 
sider the signature (relational type) A' that contains the unary symbol 
u t for every element t E T. To every pattern (Fj, <fi) we correspond the 
relational structure F' { G Rel(A, A') with the shadow Fj such that the 
element x G Fj is in the relation u^m. The converse is also evident: 
every FP-language can be defined by forbidden patterns. 

In other words the class FP is the class of languages defined by 
forbidden monadic lifts of the class Rel(A). 



It has been proved in [T5] that the classes FP and MMSNP are equal. 

The containment FP D MMSNP follows from the proof of Theorem 
every MMSNP problem (as any NP problem) can be considered as 
the shadow of a language Forb(jF') in a lifted category Rel cov (A, A'). 
In the case of the class MMSNP these lifted relations (in A') are all 
unary. And for unary relations we use the preceding remark which 
claims that monadic lifts and forbidden patterns are equivalent. In 
order to prove the converse one needs to show that every language 
defined by forbidden monadic lifts is in MMSNP. This part of proof is 
st raight forward . 

Madelaine and Stewart [20] gave a long process to decide whether an 
FP language is a finite union of CSP languages. We use Theorems [TOj 
[Til and the description of dualities for relational structures [8] to give 
a short characterization of a more general class of languages. 

Theorem 13. Consider the language L determined by forbidden monadic 
lifts. Explicitly, L = <&{Forb{T')) for a set T' C Rel(A,A') (with A' 
monadic). If no F' G For6(jF') contains a cycle then there is a set of 
finite structures V C Rel(A) such that L = CSP(V). If one of the 
lifts F' in a minimal subfamily of T' contains a cycle in its core then 
the language L is not a finite union of CSP languages. 

Proof. If no F' G Forb^J 7 ') contains a cycle then the set T' has a dual 
V in Rel cov (A, A') by [8], and the shadow of this set V gives the dual 
set T> of the set $(For6(jF')) (by Theorem [TUT) . On the other side if one 
F' G Forb(T') contains a cycle in its core and if T' is minimal (i.e. F' 
is needed) then Forfe(jF') does not have a dual in Rel cov (A, A'). The 
shadow of the language Forb^J 7 ') is the language L and consequently 
this fails to be a finite union of CSP languages by Theorem [11] (as every 
monadic shadow duality can be lifted). □ 



Now we prove one of the principal results of [7] by tools which we 
developed in previous sections. Feder and Vardi have proved that the 
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classes MMSNP and CSP are random equivalent, this was later de- 
randomised. Here we discuss the deterministic part of the Feder-Vardi 
proof. It seems that our setting streamlines some of the earlier argu- 
ments. Our proof is not essentially different from the original one, yet 
the use of dualities makes the construction of the proof natural and 
easier. 

A structure A is biconnected if every point deleted substructure is 
connected (in other words for every three distinct elements x, y and 
z there is a path connecting x and y that avoids z). Note that a 
biconnected structure with more than one relational tuple contains a 
cycle. Inclusion maximal biconnected substructures are called bicon- 
nected components (in graph theory they are called blocks). For the 
set of relational structures T> we denote by C S P g irth>k(T3) the language 
of structures in CSP(T>) with girth larger than k. We will prove the 
following: 

Theorem 14. [7j For every MMSNP language L there is a finite set of 
relational structures T> (of possibly different type) and a positive integer 
k such that the following hold. 

(1) L can be polynomially reduced to CSP(T>). 

(2) The language CSP g i r th>k{T^) can be polynomially reduced to L. 

Proof. We assume that L = §(Forb(F)) for a set T' C Rel(A, A') 
(with A' monadic). We construct the set T>. First we determine the 
type of the relational structures in T>. 

Let B C -Re/(A,A') be the set of the biconnected components of 
the structures in T' . For every isomorphism class in $(£>) we choose 
one structure B in this class. We may assume that the base set of the 
representative B is {1,...,|B|}. For each B 6 $(£>) we introduce the 
relational symbol R& of arity |B| (the size of the structure). We denote 
by (3 the type that consists of these relational symbols R&- This will 
be the type of the structures in V. 

Next, we define the following functors \1/ and O. The functor \I/ : 
Rel(A) — > Rel(j3) assigns to a structure A a structure ^(A). A and 
^(A) have the same base set and its relatons are defined as follows: 

-Rb^A)) = {f\f : B — >• A} (/ is a homomorphism in Rel(A)). 

I.e. a tuple of elements is in R& relation if it is the homomorphic 
image of B. The functor 9 maps to a structure A e Rel{(5) the 
following structure again on the same base set in Rel(A): 

0(A) = U{/(B) : B G $(£),/ e R B (A)} (here /(B) is the homo- 
morphic image of the structure B). 
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The mappings ^ and are both functoral. Consider the induced 
functors : Rd(/3, A') -> iteZ(A, A') and 6' : i?e/(A, A') -> i?e/(/3, A') 
We will use the following properties of these functors. 

(i) 6 o * = idifcjfA) and 6' o = id Rd{AA/) 

(ii) For every B G Rel(/3) (B' G Rel(/3, A')) and relational symbol 
i? G /? the following inclusions hold: 

o 0(B)) D i?(B) o e'(B')) 2 i?(B')). 

We continue the construction of P. We define the finite set of struc- 
tures Q' C Rel(p, A') as follows: We put G' G 0' if 

(1) G' is a forest, 

(2) there exists F' G T' and a homomorphism 97 : F' — > O(G'), 
such that every element of G' is contained in <£>(F') or in a 
relational tuple intersecting <p(F'). 

Observe the following straightforward consequences of the construc- 
tion of Q' . 

(iii) If A' g Forb(T') then (A') ^ Forb{Q') holds for every A' G 
Rel(A, A'), since A' G J 7 ' =>- tf(A') £ Forb{g'). 

(iv) If B' £ Forb{g') then O(B') £ Forb(F) holds for every B' G 
Rel(P, A'), since B' G 0' 0(B') £ Forb{F'). 

The set consists of finitely many relational forests. Hence we know 
by TheoremGDthat $(For&(£')) = CSP(V) for some finite V C i?e/(/5). 
We will prove that the conditions of the theorem hold for this choice 
of T>. All the reductions will be functoral. 

First we prove that L can be polynomially reduced to <&(Forb(G')). 
We succeed to show that for a structure A G Rel(A) the equivalence 
A G L -<=>- ^(A) G <&(Forb(G')) holds. This is implied by the equiv- 
alence in the lifted category as the same A' relations prove the mem- 
bership in both languages: If A' G Forb(F') then ^'(A') G Forb(Q') 
by (i) and (iv). On the other hand (iii) implies that if A' ^ Forb(T') 
then tf'(A') £ Forb(g'). 

Let k denote the size of the largest structure in T' . We prove 
that CSP g i Tt h>k{T^) can be polynomially reduced to L. In fact we 
will prove that for every B G Rel(/3) with girth > k the equiva- 
lence B G <&(Forb(Q')) -<=>- 0(B) G L holds. Again we prove 
the equivalence in the lifted categories. If O'(B') G Forb(Q') then 
^'(©'(B')) G For6(jF'), as we have seen in the reduction of L to K. 
The structure B' contains less relations than \l/'(0'(B')) by (ii), hence 
B' G Forb(Q'). If O'(B') ^ Forb(Q') then there exists a structure 
F' G JF' such that </? : F' — > B'. By the girth condition on B' we 
know that the union of <f(F') and the relational tuples intersecting 
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(p(F') is a forest. Hence there is a structure G' G Q' isomorphic to this 
substructure of B'. Now G' — > A', hence A' ^ Forb(Q'). □ 

The remaining part is the reduction of CSP with large girth to CSP. 
Feder and Vardi proved a randomized reduction, this was later deran- 
domized. 

Lemma 15 ([16]). For every finite set of relational structures T> and 
integer k > the language CSP(T>) can be polynomially reduced to 

C S Pgi r th>k(T)) ■ 

The essence of this reduction is the Sparse Incomparability Lemma 
[3 This polynomial reduction was proved with expanders in the case of 
digraphs [22J. The reduction in the case of general relational structures 
needed a generalization of expanders called expander (relational) struc- 
tures. The notion of expander relational structures was introduced in 
[T6] [15] , and also a polynomial time construction of such structures 
with large girth is given there. 
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